Learning of Protein Interaction Network

نویسنده

  • Yanjun Qi
چکیده

Protein-protein interactions (PPI) play a key role in determining the outcome of most cellular processes. Correctly identifying and characterizing protein interactions and the networks they comprise is critical for understanding the molecular mechanisms within the cell. Large-scale biological experimental methods can directly and systematically detect the set of interacting proteins within an organism. Unfortunately, the resulting datasets are often incomplete and exhibit high false positive and false negative rates. In addition to the direct experimental data, a number of large biological datasets also provide indirect evidence about protein-interaction relationships. Thus computational approaches could be utilized to combine multiple information sources in order to predict the sets of interacting protein pairs and identify important biological substructures in this network. In this dissertation, we first carry out a systematic study of the efficacy of using supervised learning methods to integrate direct and indirect biological evidence for predicting pairwise protein interactions. The results indicate that the utility of information, the way the data is encoded as features, the target types of protein interactions and the computational approaches used are all significant for predicting such interactions. We then propose four learning algorithms for deriving PPI networks from different perspectives. (I) A combined computational and experimental approach is proposed for predicting interaction partners of human membrane receptors. The random forest binary classifier is employed to determine if a potential receptor-human pair interacts or not. Biological feedback is used to optimize feature encoding and improve the accuracy of predictions. The resulting receptor PPI network is then analyzed through graph property analysis, graph module identification and protein-family related network pattern search. Several novel predictions are further experimentally validated. Our proposed framework shows that focusing on specific subnetworks generates better predictions. The predicted network provides the most reliable dataset on the network of interactions involving human membrane receptors to date. (II) Considering that PPI networks are highly sparse graphs and there is no large negative reference set (non-interacting pairs) available, we design a ranking approach to identify

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

Construction and Analysis of Tissue-Specific Protein-Protein Interaction Networks in Humans

We have studied the changes in protein-protein interaction network of 38 different tissues of the human body. 123 gene expression samples from these tissues were used to construct human protein-protein interaction network. This network is then pruned using the gene expression samples of each tissue to construct different protein-protein interaction networks corresponding to different studied ti...

متن کامل

Study of PKA binding sites in cAMP-signaling pathway using structural protein-protein interaction networks

Backgroud: Protein-protein interaction, plays a key role in signal transduction in signaling pathways. Different approaches are used for prediction of these interactions including experimental and computational approaches. In conventional node-edge protein-protein interaction networks, we can only see which proteins interact but ‘structural networks’ show us how these proteins inter...

متن کامل

Evaluation of Protein Complexes in Muscular Atrophy Using Interaction Map Analysis

Background and purpose: Muscular atrophy is a condition derived from different diseases and aging. Molecular study of the disease condition can help in developing diagnostic methods and treatment approaches. In this study, protein interaction network was analyzed to understand molecular events at protein levels. Materials and methods: In this experimental study, the network was constructed and...

متن کامل

Prediction of Coffee Effects in Rats with Healthy and NAFLD Conditions Based on Protein-Protein Interaction Network Analysis

Background and objectives: Non-alcoholic fatty liver disease (NAFLD) is a common liver condition. On the other hand, coffee consumption has shown promising for gastrointestinal diseases.  Detection of the most valuable biomarkers of decaffeinated coffee treatment in healthy and non-alcoholic fatty liver disease conditions was the aim of the present study. Methods:</stro...

متن کامل

Using the Protein-protein Interaction Network to Identifying the Biomarkers in Evolution of the Oocyte

Background Oocyte maturity includes nuclear and cytoplasmic maturity, both of which are important for embryo fertilization. The development of oocyte is not limited to the period of follicular growth, and starts from the embryonic period and continues throughout life. In this study, for the purpose of evaluating the effect of the FSH hormone on the expression of genes, GEO access codes for this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008